LISTING OF CLAIMS 



1. (currently amended) A method for a computer network user 
for creating a voice XML file automatically, comprising: 

providing a graphic user interface for defining a 
plurality of icons, each of said icons corresponding to one 
or more attributes of voice XML; 

receiving user selection input of said icons; 

recording an action stream of a user invoking said 
icons in the graphic user interface; and 

interpreting said action stream based on a library of 
voice XML tags to create the voice XML file. 

2. (original) The method according to claim 1, 
characterized in that said graphic user interface comprises 
a graphic user interface for adding one or more audio 
hyperlinks for a voice XML file automatically, wherein each 
icon, defined in said graphic user interface, corresponds to 
a kind of hyperlink. 

3. (original) The method according to claim 2, wherein 
said adding hyperlinks comprises adding the hyperlinks to a 
TTS voice XML file, and wherein said adding comprises the 
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user editing the TTS voice XML file in the edit area of said 
graphic user interface, marking or entering the parts to be 
added with the hyperlinks, invoking the corresponding icons 
and entering the corresponding hyperlink addresses. 

4. (original) The method according to claim 2, wherein said 
adding hyperlinks comprises adding the hyperlinks to a 
real-time-recorded audio voice XML stream, and wherein said 
adding comprises the user editing the TTS voice XML file in 
the edit area of said graphic user interface, marking or 
entering the parts to be added with the hyperlinks, invoking 
the corresponding icons and entering the corresponding 
hyperlink addresses, and wherein speech recognition 
technology is applied to find the parts in the 
real-time-recorded audio voice XML stream that match the 
parts entered by the user when interpreting said action 
stream based on a library of voice XML tags. 

5. (original) The method according to claim 3, 
characterized in that when the user marks or enters the same 
parts to be added with the hyperlinks in the edit area of 
the graphic user interface for many times and invokes the 
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same hyperlink attributes, the hyperlinks for the whole TTS 
voice XML stream are batch-added. 

6. (original) The method according to claim 4, 
characterized in that when user marks or enters the same 
parts to be added with the hyperlinks in the edit area of 
the graphic user interface for many times and invokes the 
same hyperlink attributes, the hyperlinks for the whole 
real-time-recorded audio voice XML stream are batch-added. 

7. (currently amended) A system for creating voice XML 
file automatically, comprising: 

a graphic user interface for defining a plurality of 
icons based on network user input , wherein each of said 
icons corresponds to one or more attributes of voice XML; 

a voice XML tag generator for interpreting said action 
stream based on a library of voice XML tags and generating 
the corresponding voice XML tags; and 

a voice XML file generator for creating the voice XML 
file by combining the contents to be played with the tags 
generated by the voice XML tag generator according to voice 
XML syntax. 
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8. (original) A system according to claim 7, characterized 
in that said graphic user interface comprise a graphic user 
interface component for adding audio hyperlinks for VoiceXML 
file automatically, wherein each icon, defined in said 
graphic user interface, corresponds to a kind of hyperlink. 

9. (original) A system according to claim 8, wherein said 
adding the hyperlinks comprises adding hyperlinks for a TTS 
voice XML stream, and wherein said adding comprises the user 
editing the TTS voice XML file in the edit area of said 
graphic user interface, marking or typing the parts to be 
added the hyperlinks, invoking the corresponding icons and 
typing the corresponding hyperlink addresses. 

10. (original) A system according to claim 8, wherein said 
adding comprises adding the hyperlinks for real-time 
recorded audio voice XML stream and wherein said system 
further comprises a speech recognition engine, said adding 
comprising the user editing the TTS voice XML file in the 
edit area of said graphic user interface, marking or typing 
the parts to be added the hyperlinks, invoking the 
corresponding icons and typing the corresponding hyperlink 
addresses, and wherein said interpreting said action stream 
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based on a library of voice XML tags further comprises said 
speech recognition engine finding the parts in the 
real-time-recorded audio Voice XML stream that match the 
parts entered by the user. 

11. (original) A system according to claim 9, characterized 
in that when the user marks or enters the same parts to be 
added as hyperlinks in the edit area of the graphic user 
interface component for many times, and invokes the same 
hyperlinking attributes, said component adds the hyperlinks 
for the whole TTS voice XML stream. 

12. (original) A system according to claim 10, 
characterized in that when user marks or enters the same 
parts to be added as hyperlinks in the edit area of the 
graphic user interface component for many times, and invokes 
the same hyperlinking attributes, said component adds the 
hyperlinks for the whole real-time-recorded audio voice XML 
stream. 

13. (currently amended) A program storage device readable 
by machine tangibly embodying a program of instructions 
executable by said machine to perform method steps for 
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creating a voice XML file automatically, said method 
comprising the steps of: 

providing a graphic user interface for defining a 
plurality of icons based on network user input , each of said 
icons corresponds to one or more attributes of voice XML; 

recording an action stream of a user invoking said 
icons in the graphic user interface; and 

interpreting said action stream based on a library of 
voice XML tags to create the voice XML file. 

14. (original) The program storage device according to claim 
13 wherein said method is characterized in that said graphic 
user interface comprises a graphic user interface for adding 
one or more audio hyperlinks for a voice XML file 
automatically, and wherein each icon, defined in said 
graphic user interface, corresponds to a kind of hyperlink. 

15. (original) The program storage device according to claim 
14, wherein said adding hyperlinks comprises adding the 
hyperlinks to a TTS voice XML file, and wherein said adding 
comprises the user editing the TTS voice XML file in the 
edit area of said graphic user interface, marking or 
entering the parts to be added with the hyperlinks, invoking 
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the corresponding icons and entering the corresponding 
hyperlink addresses. 

16. (original) The program storage device according to 
claim 14, wherein said adding hyperlinks comprises adding 
the hyperlinks to a real-time-recorded audio voice XML 
stream, and wherein said adding comprises the user editing 
the TTS voice XML file in the edit area of said graphic user 
interface, marking or entering the parts to be added with 
the hyperlinks, invoking the corresponding icons and 
entering the corresponding hyperlink addresses, and wherein 
speech recognition technology is applied to find the parts 
in the real-time-recorded audio voice XML stream that match 
the parts entered by the user when interpreting said action 
stream based on a library of voice XML tags. 

17. (original) The program storage device according to claim 
15, characterized in that when the user marks or enters the 
same parts to be added with the hyperlinks in the edit area 
of the graphic user interface for many times and invokes the 
same hyperlink attributes, the hyperlinks for the whole TTS 
voice XML stream are batch-added. 
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18. (original) The program storage device according to claim 
16, characterized in that when user marks or enters the same 
parts to be added with the hyperlinks in the edit area of 
the graphic user interface for many times and invokes the 
same hyperlink attributes, the hyperlinks for the whole 
real-time-recorded audio voice XML stream are batch-added. 
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